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LOCUS CEC01G6 204 9 bp DNA linear INV 12-NOV-2002 

DEFINITION Caenorhabditis elegans cosmid C01G6, complete sequence. 

ACCESSION Z35595 REGION: 

join (2 9625. .297 56,2982 6. .2 997 6,3014 4. .302 61,3030 6. .30571, 

3068 9. .30 931, 3104 8. .312 95, 31342. . 31522, 31806. . 32115 , 32165 . .32257, 

32307. .32533,32582. .32 661) 

VERSION:; , Z35595.1 GI: 3217055 

KEYWORDS HTG; 4 -coumarat e-coA ligase; Glycerol facilitator protein; Kringle 
domain; MIP; NADPH-f errihemoprotein reductase; Protein tyrosine 
receptor rorl; Zinc finger (C3HC4 type) . 
SOURCE Caenorhabditis elegans 

ORGANISM Caenorhabditis elegans 

Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; 
Rhabditoidea ; Rhabditidae; Peloderinae; Caenorhabditis. 
REFERENCE 1 
AUTHORS none. 

TITLE Genome sequence of the nematode C. elegans: a platform for 

investigating biology. The C. elegans Sequencing Consortium 
JOURNAL Science 282 (5396), 2012-2018 (1998) 
MEDLINE 99069613 

REMARK The C. elegans Sequencing Consortium. 
REFERENCE 2 (bases 1 to 2049) 
AUTHORS ' Berks, M. 
TITLE Direct Submission 

JOURNAL Submitted ( 02-AUG-1994 ) Nematode Sequencing Project, Sanger 

Institute, Hinxton, Cambridge CB10 ISA, England and Department of 
Genetics, Washington University, St. Louis, MO 63110, USA. E-mail: 
jes@sanger.ac.uk or rw@nematode.wustl.edu 
COMMENT On Jun 13, 1998 this sequence version replaced qi : 521002 . 

Coding sequences below are predicted from computer analysis, using 
predictions from Genefinder (P. Green, U. Washington), and other 
available information. 

Current sequence finishing criteria for the C. elegans genome 
sequencing consortium are that all bases are either sequenced 
unambiguously on both strands, or on a single strand with both a 
dye primer and dye terminator reaction, from distinct subclones. 
Exceptions are indicated by an explicit note. 

IMPORTANT: This sequence is not the entire insert of clone C01G6. 
It may be shorter because we only sequence overlapping sections 
once, or longer because we arrange for a small overlap between 
neighbouring submissions. 

The true left end of clone C01G6 is at 1 in this sequence. The true 
right end of clone C01G6 is at 42193 in this sequence. The true 
left end of clone D2013 is at 43257 in this sequence. The start of 
this sequence (1..104) overlaps with the end of sequence AL021147. 
The end of this sequence ( 4 32 57 . . 4 3 4 1 0 ) overlaps with the start of 
sequence Z47808. 



http://www.ncbi.nlm.nih.gov/entrez/viewer.fcgi?val=32 1 7055&itemID=2&view^gbwithparts 1 2/1 2/02 
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FEATURES 

source 



gene 



CDS 



For a graphical representation of this sequence and its analysis 
see:- http : //wormbase . sanqer . ac . uk/perl/ace/elegans/seq/ sequence? 
name=C01G6 

IMPORTANT: This sequence is NOT necessarily the entire insert of 
the specified clone. It may be shorter because we only sequence 
overlapping sections once, or longer because we arrange for a small 
overlap between neighbouring submissions. 

Location/Qualifiers 

1. .2049 

/organism="Caenorhabditis elegans " 
/db_xref="taxon: 6239" 
/chromosome^" II " 
/clone="C01G6 n 
1. .2049 

/gene="C01G6. 6" 
1 . .2049 

/gene="C01G6. 6" 
/standard_name="C01G6 . 6" 

/note="contains similarity to Pfam domain: PF00175 

(Oxidoreductase FAD/NAD-binding domain), Score=4 6.6, 

E-value-1. 8e-10, N=l; PF00667 (FAD binding domain), 

Score-59.7, E-value=l . 2e-15 , N=l 

cDNA EST cm06c6 comes from this gene 

cDNA EST yk680a9.5 comes from this gene 

cDNA EST ykl068e04.3 comes from this gene 

cDNA EST ykl068e04.5 comes from this gene 

cDNA EST ykl087hl2.3 comes from this gene 

cDNA EST ykl087hl2.5 comes from this gene 

cDNA EST ykl285g05.3 comes from this gene 

cDNA EST ykl285g05.5 comes from this gene 

cDNA EST ykl293ell.3 comes from this gene 

cDNA EST ykl293ell.5 comes from this gene 

cDNA EST ykl34 9bl0.3 comes from this gene 

cDNA EST ykl34 9bl0.5 comes from this gene 

cDNA EST yk882b04.3 comes from this gene 

cDNA EST yk891f04.3 comes from this gene 

cDNA EST yk891f04.5 comes from this gene" 

/codon_start=l 

/product="Hypothetical protein C01G6.6" 
/protein id=" CAA84 637 . 1 " 
/db_xref="GI : 3873859" 
/db_xref ="SWISS-PROT : Q17 574 " 

/ translation="MTDFLIAFGSQTGQAETIAKSLKEKAELIGLTPRLHALDENEKK 
FNLNEEKLCAI VVSSTGDGDAPDNCARFVRRINRNSLENEYLKNLDYVLLGLGDSNYS 
SYQTIPRKIDKQLTALGANRLFDRAEADDQVGLELEVEPWIEKFFATLASRFDISADK 
MNAITESSNLKLNQVKTEEEKKALLQKRIEDEESDDEGRGRVIGIDMLIPEHYDYPEI 
SLLKGSQTLSNDENLRVPIAPQPFIVSSVSNRKLPEDTKLEWQNLCKMPGVVTKPFEV 
LVVSAEFVTDPFSKKIKTKRMITVDFGDHAAELQYEPGDAIYFCVPNPALEVNFILKR 
CGVLDIADQQCELSINPKTEKINAQIPGHVHKITTLRHMFTTCLDIRRAPGRPLIRVL 
AESTSDPNEKRRLLELCSAQGMKDFTDFVRTPGLSLADMLFAFPNVKPPVDRLIELLP 
RLIPRPYSMSSYENRKARLIYSEMEFPATDGRRHSRKGLATDWLNSLRIGDKVQVLGK 
EPARFRLPPLGMTKNSAGKLPLLMVGPGTGVSVFLSFLHFLRKLKQDSPSDFVDVPRV 
LFFGCRDSSVDAI YMSELEMFVSEGILTDLIICESEQKGERVQDGLRKYLDKVLPFLT 
ASTESKIFICGDAKGMSKDVWQCFSDIVASDQGIPDLEAKKKLMDLKKSDQYIEDVWG 



BASE COUNT 646 a 397 c 447 g 559 t 

ORIGIN 

1 atgacagatt tcctcattgc ttttggctca caaactgggc aagcggaaac tatcgctaaa 

61 agtttaaagg aaaaagcaga gttaatcgga ttaacgccaa gacttcacgc tctggatgaa 

121 aatgagaaga aattcaattt aaatgaagaa aaactgtgtg caattgtcgt ttcatcaact 



http://www.ncbi.nlm^ 12/12/02 
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181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 



ggcgatggag 
ctggaaaacg 
tattcatctt 
aatcgtcttt 
ccatggatcg 
atgaatgcaa 
aagaaagcgt 
agagttattg 
ttaaaaggat 
cctttcatcg 
caaaacctgt 
gctgaatttg 
gatttcggag 
gttccaaatc 
gctgatcaac 
ccgggacacg 
cgtcgggctc 
gagaaaagac 
gttagaacac 
ccggttgata 
tcatatgaaa 
ggacgtcgac 
gataaagttc 
acaaagaatt 
gtatttctgt 
gtcgatgttc 
atgagcgaac 
agcgagcaaa 
ccatttttga 
tctaaagatg 
ttggaagcga 
tggggatag 



atgcaccgga 
aatatttgaa 
atcaaacgat 
ttgatcgtgc 
agaaattctt 
ttacggaatc 
tacttcagaa 
gaattgatat 
cgcaaaccct 
tctcgtcggt 
gcaaaatgcc 
tgactgatcc 
atcatgctgc 
cagcacttga 
aatgcgagct 
tgcacaaaat 
caggaagacc 
ggcttttgga 
ccggattgtc 
gactaatcga 
accgaaaagc 
attcgagaaa 
aagttttggg 
cggctggaaa 
cattcctgca 
cacgtgttct 
ttgaaatgtt 
agggtgaaag 
cagcatcgac 
tttggcaatg 
agaagaagct 



taattgtgca 
gaatctggat 
tcctcgaaaa 
agaagcagac 
tgcaacactg 
atcaaacttg 
acgaatagaa 
gttgattccc 
ctcgaacgat 
ttctaatcga 
tggagtcgtc 
attttccaag 
tgaacttcaa 
agtcaatttt 
ttcaattaac 
tacaactctc 
gctgattcgt 
gctttgcagt 
attggctgac 
acttcttcca 
ccgtctgatc 
aggactcgca 
aaaggaaccg 
actgccatta 
ttttttgaga 
cttctttgga 
tgtaagcgaa 
agttcaagat 
agagtcgaaa 
tttctcggat 
tatggatctg 



cgtttcgtga 
tacgttcttc 
attgacaaac 
gatcaagtgg 
gcatccagat 
aaattgaatc 
gatgaagaat 
gagcattatg 
gaaaatttgc 
aaactaccag 
acgaaaccat 
aaaattaaaa 
tacgagccgg 
atccttaaaa 
ccgaaaactg 
cgtcatatgt 
gtactcgccg 
gctcagggaa 
atgcttttcg 
cggcttatcc 
tattcggaaa 
actgattggc 
gctagattca 
ttgatggttg 
aaattgaaac 
tgtcgtgatt 
ggcatcctca 
ggcttaagaa 
attttcatct 
attgttgcca 
aaaaagagtg 



gaagaattaa 
tcggtctcgg 
aactgacagc 
gacttgaact 
tcgacatttc 
aagttaagac 
cagatgatga 
attatcccga 
gcgttccaat 
aagataccaa 
tcgaagttct 
cgaaacggat 
gagatgccat 
gatgcggagt 
agaaaatcaa 
tcaccacgtg 
aaagtacatc 
tgaaggattt 
catttcca'aa 
cacgtccgta 
tggaattccc 
tcaattcttt 
gacttccacc 
gaccaggaac 
aagactctcc 
cttctgtaga 
ctgatctaat 
aatatctcga 
gtggagacgc 
gtgatcaagg 
atcaatacat 



tagaaattca 
cgattccaac 
tctcggtgca 
agaagttgaa 
cgccgacaaa 
tgaagaagaa 
aggaagagga 
aatttctctt 
tgctcctcaa 
attggaatgg 
tgtagtctct 
gatcactgtc 
atatttttgt 
tttagacatt 
tgctcaaatt 
tctagacatt 
tgatccaaat 
caccgatttt 
tgtgaagcca 
cagtatgtca 
agcaacagat 
gagaattgga 
tcttggaatg 
tggagtctcg 
atcggacttt 
tgcaatttat 
catctgtgaa 
caaggttctt 
aaaaggaatg 
aattc'cggac 
cgaggatgtg 



// 



Revised: July 5, 2002. 



Disclaimer | Write to the Help Desk 
NCBI | NLM | NIH 



Dec 5 2002 18:17:57 



http://www.ncbi. nlm.nih.gov/entrez/viewer.fcgtf^ 12/12/02 



